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Figure  Captions 


Fig.  1.  Examples  of  high  contrast  stimuli  with  wobble 
present  on  all  three  physical  dimensions.  All 
three  values  used  on  each  of  the  three  dimen- 
sions are  shown  across  the  three  stimuli;  these 
values  are  given  for  Stimuli  1,  H , and  7 
(Stimulus  Set  A)  in  Table  1.  11 

Fig.  2.  The  observed  psychophysical  relations  between  the 
three  physical  dimensions  (AM,  Width,  and 
Location)  and  the  corresponding  INDSCAL-der ived 
perceptual  dimensions  (^  . , ^ ? , and  4*-,),  for 
each  of  the  two  sim il ar ity- j adgment  conditions. 

In  each  case,  the  best-fitting  straight  line 
is  shown  . 16 

Fig.  3-  The  INDSCAL-der ived  coordinates  of  each  of  the 
nine  stimuli  in  Stimulus  Set  A on  perceptual 
dimension  3 plotted  against  coordinates  of  the 
stimuli  on  perceptual  dimension  2.  Stimuli 
sharing  the  same  FI  (F2)  value  are  connected  by 
solid  (dashed)  lines.  17 

Fig.  H.  Relative  dimension  salience  weights  from  INDSCAL 
for  each  of  the  three  perceptual  dimensions 
derived  from  INDSCAL  in  the  two  conditions  of  the 
sim il ar ity- judgment  task,  for  each  of  the 
six  observers.  20 

Fig.  5.  Distribution  of  response  probability  for  each 

of  the  nine  stimuli  in  Condition  1,  for  three  of 
the  six  observers.  Obtained  distributions  are 
given  by  filled  circles;  predicted  distributions 
are  given  by  open  circles.  25 

Fig.  6.  Left  panels:  The  observed  (filled  ('’>cles) 

and  predicted  (open  circles)  relative  salience 
weights  for  each  of  the  three  perceptual 
dimensions  for  the  final  300-trial  block 
of  Condition  1 and  successive  blocks  of  Condition 
2,  for  each  observer  in  Group  B. 

Right  panels:  The  observed  (filled  circles) 
and  predicted  (open  circles)  probability  of  a 
correct  identification  for  successive  blocks 
of  trials  in  Conditions  1 and  2 for  each  observer 
in  Group  B.  32 
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Table  Captions 


Table  1.  Physical  values  defining  each  stimulus 

on  the  three  primary  dimensions  (AM,  FI,  F 2) 
and  the  two  derived  dimensions 
(Width , Location) . 


Table  2.  Proportion  of  variance  accounted  for 

by  the  model  and  estimated  parameter  values 
for  each  observer. 
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Introduction 

The  process  of  identifying  a complex  visual  or  auditory 
stimulus  from  a set  of  similar  stimuli  requires  both  a 
psychological  representation  for  each  candidate  stimulus  and 
an  appropriate  decision  process  to  govern  the  choice  among 
them.  In  recent  work  (Getty,  Swets,  Swets , and  Green,  in 
press),  we  describe  an  approach  to  understanding  complex 
stimulus  identification  that  involves  two  parts:  (1)  the 
derivation  of  a multidimensional  perceptual  space  for  a set 
of  complex  stimuli  from  the  application  of  a 
multidimensional  scaling  (MDS)  procedure  to  judgments  of 
stimulus  similarity,  and  (2)  the  use  of  a probabilistic 
decision  model  to  predict  the  identification  confusion 
matrix  based  on  the  geometric  structure  of  the  MDS-derived 
perceptual  space.  Using  this  approach  with  a set  of  eight 
visual  spectrograms  of  real  underwater  sounds,  we  were  quite 
successful  in  predicting  the  confusion  matrices  for 
individual  observers,  accounting  for  an  over-all  average  of 
94  percent  of  the  observed  variance  across  several 
conditions  of  both  complete-  and  part ial- ident i f icat ion 
tasks.  In  another  study  that  proposed  a related  type  of 
probabilistic  decision  model  — and  again  using  a MDS 
procedure  to  determine  the  perceptual  space  --  Howard, 
Balias,  and  Burgy  ( 1 97b)  were  able  to  account  for  between  61 
and  96  percent  of  the  variance  in  individual  confusion 
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matrices  obtained  for  a set  of  16  complex  acoustic  patterns. 

In  our  previous  study,  we  found  that  observers  appeared 
to  weight  differentially  information  derived  from  the 
several  perceptual  dimensions  in  such  a way  as  to  maximize 
the  average  probability  of  a correct  identification. 
Similarly,  Howard  et  al.  reported  that  their  model  estimates 
of  dimension  "emphasis"  supported  the  hypothesis  that 
relative  emphasis  was  allocated  to  dimensions  by  listeners 
so  as  to  maximize  the  average  probability  correct. 


In  this 

paper,  we 

pursue  the 

implication  of 

the 

findings  that 

observers 

are  flexible 

in  their  use 

o f 

perceptual  dimensions. 

Spec i f ical 1 y , 

we  suggest 

that 

observers  are  engaged  in  a continuing,  dynamic  process  of 
adaptive  tuning  of  dimension  weights  so  as  to  optimize  a 
particular,  task-dependent  criterion  of  performance.  In  the 
usual  identification  task,  this  criterion  is  either 
explicitly  stated,  or  implicitly  understood,  to  be 
maximizing  the  probability  of  a correct  id ent i f icat ion  . 
Clearly,  however,  changes  in  the  task  could  invoke  other 
criteria,  such  as  maximizing  the  probability  of  correctly 
identifying  some  subset  of  the  stimuli,  or  maximizing  the 
probability  of  correctly  classifying  the  stimuli  into  a 
particular  set  of  categories.  Different  criteria  would 
dictate  different  optimal  patterns  of  relative  dimension 


weights . 
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The  optimal  pattern  of  dimension  weights  is  obviously 
stimulus-dependent  as  well  as  task-dependent.  For  example, 
given  the  optimal  pattern  of  relative  dimension  weights  for 
a particular  set  of  stimuli,  if  we  now  decrease  the  range  of 
stimulus  variation  on  one  of  the  perceptual  dimensions 
towards  zero,  we  may  expect  the  optimal  weight  on  that 
dimension,  relative  to  the  others,  to  likewise  decrease 
towards  zero.  In  the  identification  experiment  that 
follows,  we  made  use  of  this  manipulation  in  several 
different  conditions  to  produce  different  expected  patterns 
of  dimension  weights.  Our  objectives  in  these  conditions 
were  to  determine  whether  or  not  tuning  of  dimension  weights 
occurs  and,  if  so,  whether  or  not  the  optimization  criterion 
seems  to  be  maximization  of  the  probability  of  correct 
identification  . 

We  had  several  other  objectives  in  this  experiment  as 
well.  First,  we  wished  to  test  the  reliability  of  the 
perceptual  space  derived  from  INDSCAL  (Carroll  and  Chang, 
1970;  Carroll,  1972;  Carroll  and  Wish,  197^),  which  is  the 
MDS  procedure  we  applied  to  judgments  of  stimulus 
similarity.  Second,  we  wished  to  test  a decomposabil ity 
assumption  of  the  decision  model.  Finally,  we  wished  to 
obtain  multiple  tests  of  the  model's  adequacy  in  predicting 
individual  confusion  matrices  under  a variety  of  conditions, 
using  a set  of  stimuli  that  were  sufficiently  similar  to 
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yie^d  a high  rate  of  confusion  errors.  Each  of  these  goals 
is  elaborated  at  appropriate  points  in  the  discussion. 


The  Identification  Model 


We  describe  here,  briefly,  the  decision  model  which 
predicts  the  distribution  of  response  probabilities  for  each 
stimulus,  given  the  relative  loci  of  the  stimuli  in  the 
observer's  multidimensional  perceptual  space.  (For  a fuller 
discussion  of  the  model,  see  Getty  et  al . , in  press). 


We  assume  that  the  distance  between  stimulus  and 


stimulus  Sj  in  the  perceptual  space  is  given  by  a weighted 


Euclidean  metric 


D . 


i »J 


k wk  ( ^i  ,k  “ ,k} 


(1  ) 


where  w^  is  a salience  weight  measuring  the  relative 


importance  of  dimension  k ( w^  0 and  £wk  = 1),  and  ^ ^ is 


the  coordinate  value  of  stimulus  on  dimension  k.  The 


salience  weights  are  model  parameters;  the  stimulus 
coordinates  are  obtained  from  the  MDS  procedure  applied  to 
judgments  of  stimulus  similarity. 


We  define  a set  of  confusion  weights  C.  . such  that  the 

1 * J 


con fusab il ity  of  stimulus  with  stimulus  S j is  given  by 


-H- 


>i4r 
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C.  . = exp  (-a  D.  .)  (2) 

1 » J 1 > J 

where  a is  an  observer  sensitivity  parameter,  greater  than 

zero,  and  therefore  C.  . is  bounded  between  0 and  1.  This 

1 > J 

exponential  relationship  has  received  support  both  in  our 
earlier  experiments  and  in  Shepard's  related  work  on 
stimulus  generalization  (1957,  1958a,  1958b). 

Finally,  the  conditional  probability  of  giving  the 
response  assigned  to  stimulus  , R ^ , when  stimulus  was 
presented  is  given  by  Luce's  biased  choice  model  (1963): 


Pr(R  . |Si) 


b . C . . 

_J i|J. 

£ bkCi,k 


(3) 


where  b^  is  a response  bias  weight  describing  the  relative 
bias  towards  making  response  R ^ , ( b ^ 0 and  Tb^  = 1)  and  k 
is  an  index  over  the  set  of  stimuli. 


Method 


Apparatus 

All  experimental  events  were  controlled  by  a DEC 
PDP-11/39  minicomputer.  The  visual  stimuli  were  constructed 
on  a COMTAL  8000-SA  image- process i ng  system  and  displayed 
within  a 512  x 512  pixel  (picture  element)  matrix  which 


• • >■ 


— 

. 
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filled  a 24-cm  square  area  on  a CONRAC  17-inch  (43-cm)  SNA 
television  monitor.  The  monitor  brightness  and  contrast 
were  adjusted  such  that  middle  gray  (128  gray  units  on  the 

p 

COMTAL  scale)  had  a luminance  of  about  46  cd/m  , and  full 

2 

white  (255  gray  units)  a luminance  of  about  480  cd/m  . 
Ambient  room  lighting  was  maintained  at  a dim  level. 

In  both  the  simil ar ity- judgment  and  identification 
tasks,  three  observers  (one  group)  sat  at  one  time  at 
individual  video  terminals  (Lear  Siegler  ADM-3A) 
approximately  2 m from  the  stimulus-display  monitor,  whose 
center  was  about  1.1  m above  the  floor. 

Stimuli 

Our  stimuli  were  synthesized  visual  spectrograms  of 
complex,  two-formant  sounds.  The  stimuli  were  displayed  as 
frequency  (horizontal  axis)  versus  time  (vertical  axis) 
versus  energy  (darkness  — the  greater  the  energy,  the 
darker  the  trace).  They  varied  along  three  physical 
dimensions:  (1)  location  of  the  center  of  the  first 

formant,  (2)  location  of  the  center  of  the  second  formant, 
and  (3)  the  frequency  of  sinusoidal  amplitude-modulation 
(AM)  in  the  temporal  direction  (corresponding  to  a 
low-frequency  periodicity  in  the  sound). 
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A basic  set  of  27  stimuli  was  defined  by  all 
combinations  of  three  equally-spaced  values  on  each  of  the 
three  dimensions.  The  values  for  the  location  of  the  first 
formant  were  20,  25,  and  30  elements  from  the  left  edge  of 
the  stimulus  image.  (The  stimulus  was  126  elements  wide  by 
128  elements  high.)  The  three  values  for  chp  location  of 
the  second  formant  were  90,  95,  and  100  elements  from  the 
left  edge.  The  three  frequencies  of  amplitude  modulation 
were  15,  17,  and  19  cycles  per  stimulus.  The  27  stimuli 
were  "randomly"  partitioned  into  three  sets  of  nine  stimuli, 
subject  to  the  following  constraints:  (1)  no  two  stimuli 
within  a set  shared  more  than  one  coordinate  value  in 
common;  (2)  for  each  set,  each  of  the  three  values  on  each 
of  the  three  dimensions  occurred  with  exactly  three  of  the 
stimuli.  The  purpose  of  these  constraints  was  to  obtain  a 
uniform  distribution  of  stimuli  throughout  the  space  within 
each  set.  The  resulting  three  sets,  labelled  A,  B,  and  C, 
are  given  in  Table  1. 


The  stimulus  image  was  constructed  by  subtracting 
profiles  of  the  two  formants  from  a background  of  random, 
Gauss i an-d  i str ibuted  noise.  The  noise  consisted  of  a 
126  x 128  (w  x h)  matrix  of  elements,  each  element  having  an 
independent  gray  value  sampled  anew  on  each  trial  from  a 
Gaussian  distribution  with  a mean  of  128  gray  units  and  a 
standard  deviation  of  25  gray  units.  The  first  and  second 
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Table  1 

Physical  values  defining  each  stimulus  on  the 
three  primary  dimensions  (AM,  FI,  F2)  and  the 
two  derived  dimensions  (Width,  Location) 


Stimulus 

AM 

FI 

F2 

Width3  Location*5 

Number 

Stimulus  Set  A 


1 

17 

30 

100 

70 

65.0 

2 

17 

25 

90 

65 

57.5 

3 

19 

25 

100 

75 

62.5 

4 

15 

25 

95 

70 

60.0 

5 

1 5 

20 

100 

80 

60.0 

6 

1 7 

20 

95 

75 

57.5 

7 

19 

20 

90 

70 

55. C 

8 

15 

30 

90 

60 

60.0 

9 

19 

30 

95 

65 

62.5 

Stimulus  Set  8 

1 

1 5 

20 

95 

7 5 

57.5 

2 

15 

25 

90 

65 

57.5 

3 

1 5 

30 

100 

70 

65.0 

4 

I 7 

20 

90 

70 

55.0 

5 

1 7 

25 

100 

75 

62  . 5 

6 

1 7 

30 

95 

65 

62.5 

7 

1 9 

20 

100 

80 

60.0 

8 

1 9 

25 

95 

70 

60.0 

9 

19 

30 

90 

60 

60.0 

Stimulus  Set  C 

1 

1 5 

20 

90 

70 

55.0 

2 

1 5 

30 

95 

65 

62.5 

3 

19 

20 

95 

75 

57.5 

4 

1 7 

30 

90 

60 

60.0 

5 

1 5 

25 

100 

75 

62.5 

6 

1 7 

25 

95 

70 

60.0 

7 

19 

30 

100 

70 

65.0 

8 

1 7 

20 

100 

80 

60.0 

9 

19 

25 

90 

65 

57.5 

a Width  « F2  - FI 
0 L oca t ion  * (FI  ♦ F2)/2 
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formants  were  given  Gauss i an- shaped  horizontal  profiles  with 
a peak  amplitude  of  40  gray  units  and  standard  deviations  of 
10  and  18  pixels,  respectively.  The  amplitude  modulation 
was  imposed  on  the  resulting  image  in  the  vertical  direction 
with  a modulation  depth  of  50  percent.  Finally,  each 
element  was  expanded  to  fill  a 2 x 2 pixel  area,  resulting 
in  an  image  252  x 256  pixels  on  the  display  monitor. 

These  high-contrast  stimuli,  were  used  in  the 
similarity-judgment  task.  In  order  to  increase 
confusabil ity  of  the  stimulus  set  for  the  identification 
task,  and  to  increase  their  realism,  the  stimuli  were 
modified  in  two  ways.  First,  we  introduced  temporal 
variability  or  "wobble"  into  the  displayed  value  of  the 
stimulus  on  each  of  the  three  dimensions.  Thus,  scanning 
down  each  of  the  examples  shown  in  Fig.  1,  you  will  see 
(independent)  wobble  in  the  location  of  each  of  the  two 
formants  and  in  the  period  of  the  amplitude  modulation.  For 
each  dimension,  the  temporal  pattern  of  deviation  from  the 
value  specified  for  the  stimulus  was  generated  by  summing 
together  four  sinusoids  with  frequencies  1,  2,  3,  and  4 
cycles/stimulus,  each  beginning  at  a different  random  phase 
that  was  sampled  anew  on  each  trial.  By  summing  only 
integral  frequencies,  we  guaranteed  that  the  total  deviation 
about  the  specified  dimension  value,  integrated  down  the 
length  of  a stimulus  image,  was  always  zero.  As  the  second 

-9- 


Report  No.  3930 


Bolt  Beranek  and  Newman  Inc. 


modification  to  increase  confusabil ity  we  reduced  the 
signal-to-noise  ratio  considerably  by  decreasing  the  peak 
amplitude  of  the  first  and  second  formants  from  40  gray 
units  to  8 and  10,  respectively.  As  a result,  the  formants 
were  barely  perceptible  within  the  stimulus. 

Observer  s 

Two  groups  of  three  observers,  referred  to  as  Groups  A 
and  B,  participated  in  the  experiment.  Four  of  the  six 
observers  were  recruited  from  an  observer  pool  and  paid  for 
their  assistance;  the  other  two  were  members  of  BBN ' s 
technical  staff,  including  one  of  the  experimenters  (JBS). 

Proced  ur  e 

Similarity-Judgment  Task.  The  observers  were 
instructed  to  rate  similarity  of  pairs  of  stimuli  on  a 
7-point  scale,  with  the  scale  endpoints  "1"  and  "7" 
indicating  very  dissimilar  and  very  similar  stimuli, 
respectively.  Each  of  the  36  possible  pairs  for  a 
particular  set  of  nine  high-contrast,  wobbleless  stimuli  was 
presented  side-by-side  on  the  display  monitor  for  15 
seconds,  followed  Dy  an  ob server- paced  response  interval. 
Four  blocks  of  36  trials  were  presented,  using  a different 
random  order  for  each  block,  and  with  the  left-right 
position  of  stimuli  within  a given  pair  counterbalanced  ->ver 
bloc ks  . 


-10- 


Examples  of  high  contrast  stimuli  with  wobble 
present  on  all  three  physical  dimensions.  All 
three  values  used  on  each  of  the  three  dimen- 
sions are  shown  across  the  three  stimuli;  these 
values  are  given  for  Stimuli  1,  4,  and  7 
(Stimulus  Set  A)  in  Table  1. 
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All 

six 

observers  participated 

in  two 

d i f ferent 

cond it  ions 

of 

the  sim il ar i ty- judgment 

task . 

The 

two 

cond itions 

were 

run  successively,  between  the 

first 

and 

second  conditions  of  the  identification  task. 

Identification  Task.  Each  trial  began  by  blanking  of 
the  display  monitor  screen,  followed  approximately  2.5 
seconds  later  by  a low-contrast,  wobbly  display  of  a 
stimulus  image,  chosen  randomly  from  the  set  of  nine  being 
used.  Each  observer  then  made  a self-paced  identification 
response  (from  the  numbers  "1"  to  "9"  on  the  keyboard), 
making  reference  to  a folder  of  labelled,  high-contrast, 
wobbleless  Polaroid  photographs  of  the  nine  stimuli.  After 
all  observers  had  responded,  the  number  of  the  presented 
signal  was  displayed  on  each  observer's  terminal.  The 
stimulus  number  and  stimulus  image  remained  on  display  for 
for  approximately  two  seconds  after  which  the  next  trial 
began.  Trials  were  grouped  into  50-triai  blocks,  with  three 
blocks  presented  each  day  for  seven  days  for  each  of  three 
conditions  of  the  task  for  each  group.  The  nature  of  each 
condition  for  each  group  is  discussed  later  as  each  arises. 


Derivation  of  the  Perceptual  Space 

The  primary  purpose  of  the  first  similarity-judgment 
condition  was  to  derive  a multidimensional  representation  of 
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the  observers'  perceptual  space  in  terms  of  both  the 
identity  of  the  psychological  dimensions  comprising  the 
space  and  the  relative  loci  of  the  stimuli  within  the 
multidimensional  space.  Stimulus  Set  A (as  defined  in  Table 
1)  was  used  in  this  condition  — the  same  set  of  nine 
stimuli  used  for  all  observers  in  the  immediately  preceding 
identification  condition.  It  was  hoped  that  by  running  the 
similarity-judgment  task  immediately  after  a condition  of 
the  identification  task,  and  by  using  the  same  stimulus  set 
in  both  cases,  that  the  same  perceptual  dimensions  salient 
to  the  observers  in  the  identification  task  would  remain 
salient  in  the  similarity-judgment  task.  Our  concern 
stemmed  from  a previous  study  (Getty  et  al . , in  press)  in 
which  one  dimension  (low-frequency  periodicity  or  amplitude 
modulation)  that  was  apparently  utilized  by  observers  when 
identifying  the  stimuli  was  not  used  when  they  rated 
similarity  of  the  stimuli.  In  anticipation  of  our  present 
results,  that  problem  did  not  arise  here. 

The  objectives  of  the  second  sim  i 1 ar ity- judgment 
condition  were  to  provide  a test  of  the  reliability  of  the 
INDSCAL  procedure,  and  to  test  the  decomposabil ity  property 
of  INDSCAL.  Decomposabil ity  is  the  assumption  that  the 
distance  between  any  two  points  in  the  derived 
multidimensional  space  can  be  decomposed  into  independent 
contributions  from  each  of  the  derived  dimensions  (see 
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Tversky  and  Krantz,  1970).  Stimulus  Set  B,  the  stimuli  used 
in  this  condition  and  defined  in  Table  1,  shared  with 
Stimulus  Set  A the  same  three  values  on  each  of  the  three 
physical  dimensions,  although  in  different,  unique 
combinations.  Given  that  these  new  stimuli  lay  in  the  same 
region  of  the  physical  space,  we  would  expect  to  derive  the 
same  perceptual  dimensions  from  INDSCAL  in  the  second 
condition  as  in  the  first  condition,  and  to  find  the  same 
psychophysical  mapping  of  each  physical  dimension  into  its 
corresponding  psychological  dimension.  This  latter 
expectation  requires,  in  part,  that  decom posab il ity  be 
satisfied  . 

Results 

We  calculated  the  average  similarity  rating  for  each  of 
the  3^  stimulus  pairs,  separately  for  each  observer  and  each 
condition.  The  average  for  each  pair  was  based  on  judgments 
from  the  second,  third,  and  fourth  blocks  of  trials;  the 
first  block  of  trials  was  regarded  as  practice  and  excluded 
from  analysis.  The  average  ratings  for  each  observer  were 
then  submitted  to  MDS  analysis,  separately  for  the  two 
conditions,  using  the  metric  version  of  INDSCAL. 

Condition  1.  The  rating  data  for  stimulus  Set  A were 
fitted  well  by  a three-dimensional  solution  which  accounted 
for  89  percent  of  the  variance  in  the  data.  The  first 
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psychological  dimension  is  cleariy  identified  with  the 
physical  AM  dimension.  The  psychophysical  relationship, 
shown  in  the  top,  left  panel  of  Fig.  2,  appears  to  be  highly 
linear,  at  least  over  the  physical  range  utilized.  The 
best-fitting  straight  line  accounts  for  99  percent  of  the 
observed  variability  in  the  psychological  dimension. 

Surprisingly,  the  identities  of  the  other  two 
psychological  dimensions  are  not  the  locations  of  the  first 
and  second  formants,  those  being  the  other  two  physical 
dimensions  manipulated  in  the  construction  of  the  stimuli. 
Rather,  they  appear  to  be  identified  with  a related  pair  of 
physical  dimensions  obtained  by  a ^-degree  rotation  of  the 
FI  and  F2  axes.  This  geometric  relationship  can  be  seen  in 
Fig.  3 which  shows  the  stimuli  plotted  in  the  ip  ^ , ip  ^ plane. 
Stimuli  sharing  the  same  value  of  FI  are  connected  by  solid 
lines;  those  sharing  the  same  value  of  F 2 are  connected  by 
dashed  lines.  A possible  basis  for  the  obtained 
psychological  dimensions  becomes  apparent  if  one  regards 
each  stimulus  as  being  composed  of  a central  vertical  column 
bounded  on  either  side  by  FI  and  F2  (see  Fig.  1).  Then  the 
physical  dimension  corresponding  to  ip  ^ is  the  width  of  the 
column  — the  distance  between  FI  and  F2  — and  the  physical 
dimension  corresponding  to  i|^  is  the  location  of  the  column, 
that  is,  the  location  of  the  midpoint  between  FI  and  F2, 
given  by  (FI  ♦ F2)/2. 


■ - - — - - 
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Fig.  2.  The  observed  psychophysical  relations  between  the 
three  physical  dimensions  (AM,  Width,  and 
Location)  and  the  corresponding  INDSCAL-der ived 
perceptual  dimensions  ( ip  , ij>_,  and  tjO  , for 
each  of  the  two  sim il ar ity- j udgment  conditions. 

In  each  case,  the  best-fitting  straight  line 
is  shown  . 
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COORDINATE 


The  INDSCAL-der i ved  coordinates  of  each  of  the 
nine  stimuli  in  Stimulus  Set  A on  perceptual 
dimension  3 plotted  against  coordinates  of  the 
stimuli  on  perceptual  dimension  2.  Stimuli 
sharing  the  samp  FI  (F2)  value  are  connected  by 
solid  (dashed)  lines. 
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Plotting  the  perceptual  coordinates  of  the  stimuli  on 
perceptual  dimensions  2 and  3 against  the  physical  measures 
of  Width  and  Location,  respectively,  shown  in  the  righthand, 
top  two  panels  of  Fig.  2,  we  see  that  both  psychophysical 
relations  are  quite  linear.  The  best-fitting  lines  account 
for  97  and  9b  percents  of  the  variance  in  tp ^ and  , 
respectively . 

Condition  2.  The  rating  data  for  stimulus  Set  B were 
fitted  well  by  a three-dimensional  INDSCAL  solution  which 
accounted  for  86  percent  of  the  variance  in  the  data,  nearly 
the  same  value  obtained  in  Condition  1. 

Correlating  the  three  perceptual  dimensions  with  the 
physical  dimensions  determined  in  Condition  1,  we  found  a 
correspondence  between  and  Width,  between  and  AM,  and 
between  <1^  and  Location.  As  in  Condition  1,  the 
psychophysical  relations,  shown  in  the  lower  row  of  Fig.  2, 
are  all  quite  linear.  The  best-fitting  straight  lines 
account  for  89,  99,  and  87  percents  of  the  variance  on 
the  ^ , t^,  and  ip ^ dimensions  respectively. 

We  are  thus  able  to  account  for  the  similarity  rating 
of  two  independent  sets  of  stimuli  drawn  from  a common 
physical  space  in  terms  of  the  same  three  perceptual 
dimensions,  although  the  ordering  of  AM  and  Width  dimensions 
is  reversed  in  the  INDSCAL  solution  of  the  second  condition. 
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Moreover,  both  sets  of  data  support  the  conclusion  that  the 
perceptual  dimensions  are  linearly  related  to  their 
respective  physical  dimensions. 

Salience  Weights.  In  addition  to  the  coordinates  for 
each  stimulus  in  the  multidimensional  space,  TNDSCAL  also 
produces  a vector  of  salience  weights  for  each  observer 
reflecting  the  relative  importance  of  each  dimension  for 
that  observer.  The  derived  salience  weights  for  each 
observer  in  both  conditions  of  the  task  are  shown  in  Fig.  4. 
The  results  indicate  that  there  are  substantial  individual 
differences  in  the  pattern  of  salience  weights  across 
observers,  but  that  there  is  considerable  stability  in  the 
pattern  of  weights  for  a given  individual  across  the  two 
cond  itions  . 

Prediction  of  Identification  Confusion  Matrices 

The  first  condition  of  the  identification  task  was  the 
same  for  both  groups  of  observers,  and  made  use  of  the  nine 
patterns  comprising  Stimulus  Set  A.  It  provided  a first 
test  of  the  model's  accuracy  in  predicting  confusion 
matrices  and  served  as  a reference  for  the  other  conditions. 

Prior  to  starting  this  condition,  all  observers 
received  six  sessions  of  practice  (150  tr i al s/ sess  ion ) 
identifying  high-contrast , wobbleless  stimuli,  and  one  day 
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of  practice  identifying  low-contrast,  wobbly  stimuli. 
Identification  data  were  then  collected  for  seven  days;  the 
resulting  confusion  matrix  for  each  observer  contained  1050 
trials. 


Model  Anal ys i s 


Parameter  Es tima tji l_oj n . Model  parameters  were  estimated 
separately  for  each  observer  in  a two-stage  process.  First, 
an  iterative  parameter-estimation  procedure  was  used  to 
determine  the  set  of  response  bias  weights,  b ^ , that 
minimized  the  summed  squared  deviation  between  obtained  and 
predicted  confusion  matrices  (as  given  by  Eq . 3).  The 
response  bias  weights  were  then  fixed  at  their  estimated 
values  and  a second  estimation  procedure  carried  out  using 
the  fuli  structure  of  the  model  (Eqs.  1,  2,  and  3)  and  the 
normalized  three-d imens ional  loci  of  the  stimuli  obtained 
from  the  TNDSCAL  analysis  of  the  sim  ilar ity-judgment  task. 
Estimates  were  obtained  for  the  three  salience  weights,  w^, 
w0,  and  w ^ , and  the  sensitivity  parameter,  a.* 

Predict  ion  of  In d i v id u al  Confusion  Matrices . The  model 
predicts  the  individual  confusion  matrices  very  well, 

T we  e s t frn a t ed  ~the  d imensrorf  ~sal  ience  weights  Instead  of 
using  the  INDSCAL-der i ved  weights  for  each  observer  because 
the  requirements  of  the  identification  task  may  well  result 
in  salience  weights  different  than  those  of  the 
s lm i 1 ar i t y - j udgment  task. 
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accounting  for  an  average  of  92  percent  of  the  variance 
across  the  six  observers.  Moreover,  we  did  not  find  any 
obvious  violations  of  model  assumptions  in  the  remaining 
variance  not  accounted  for.  The  goodness-of-fit  and  the 
estimated  model  parameter  values  are  shown  in  Table  2 
(Condition  1)  for  each  observer.  The  relative  importance  of 
the  three  dimensions  appears  to  vary  to  some  extent  across 
observers;  however,  there  is  a general  tendency  for  Width 
and  Location  to  be  weighted  about  equally,  and  both  to  be 
weighted  more  heavily  than  AM. 

The  predicted  and  obtained  response  distributions  for 
each  stimulus  are  shown  in  Fig.  5 for  three  of  the  six 
observers,  chosen  to  represent  the  worst  (DT),  median  (ST), 
and  best  (DW)  fits  of  the  model  to  the  data.  The  over-all 
probability  of  a correct  identification  was  0.50,  averaged 
over  stimuli  and  observers.  Thus,  about  half  of  the 
approximately  100  responses  contributing  to  each 
distribution  were  confusion  errors.  In  Fig.  5,  it  can  be 
seen  that  the  model  generally  predicts  with  considerable 
accuracy  both  the  probability  of  a correct 
identification  — the  highest  peak  in  most 
distributions  --  and  the  patterns  of  confusion  errors.  Even 
in  cases  where  the  prediction  of  the  exact  magnitude  of  a 
major  confusion  response  is  not  accurate  --  for  example, 
Response  3 to  Stimulus  1 for  observer  ST  — the  model  has  at 


Report  No.  3930 


Bolt  Beranek  and  Newman  Inc 


Table  2 


Proportion  of 

variance  accounted  for  by 

the  model 

and  estimated 

parameter  values 

for  each 

observer 

2 

r 

W1 

w2 

w 3 

i 

OB 

(Amp.  Mod.) 

(Width) 

(Loc. ) 

CONDITION  1 

GROUP  A 

BS 

.95 

.07 

.46 

.47 

5.9 

ST 

.91 

.00 

.45 

.55 

4.5 

DW 

.98 

.43 

.34 

.24 

6.1 

575 

.95 

.17 

742 

.42 

T3 

GROUP  B 

JK 

.86 

.23 

.38 

.38 

4.3 

JS 

.97 

.22 

.45 

.34 

5.4 

DT 

.83 

.12 

.47 

.41 

3.6 

AVG 

T9 

.43 

.38 

4.4 

COND.  AVG. 

.92 

.18 

.43 

.40 

5.0 

CONDITION  2 

GROUP  A 

BS 

.96 

.09 

.31 

.61 

5.8 

ST 

.92 

.00 

.44 

.56 

5.1 

DW 

.99 

.36 

.34 

.31 

7.3 
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least  successfully  predicted  the  response  to  be  a major 
confuser  . 

One  can  also  see  in  Fig.  5 that  while  the  patterns  of 
confusion  errors  for  a particular  stimulus  are  often  similar 
across  observers,  there  are  some  differences  --  occasionally 
large.  The  model  accounts  for  these  individual  differences 
based  on  a different  pattern  of  dimension  salience  weights 
for  each  observer. 

A Test  of  Dimensional  Decom posabil ity 

The  decision  model  assumes  that  the  measure  of 
interstimulus  distance  is  decomposable  into  independent 
contributions  from  each  of  the  perceptual  dimensions 
(Eq.  1).  In  the  first  condition  of  the  identification  task 
just  described,  the  stimuli  were  the  same  as  those  used  in 
the  sim il ar ity- judgment  task  from  which  the  perceptual 
representation  of  the  stimuli  was  derived.  Our  objective  in 
a second  condition  of  the  identification  task  was  to  test 
the  decomposab il ity  assumption  using  the  perceptual  space 
derived  from  one  set  of  nine  stimuli  (Stimulus  Set  A in 
Table  1)  to  predict  the  confusions  for  an  independent  set  of 
nine  stimuli  (Stimulus  Set  C)  . These  new  stimuli  share  with 
the  old  set  the  same  three  values  on  each  of  the  three 
dimensions,  but  in  new,  independent  combinations.  If  each 
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dimension  contributes  independently  to  interstimulus 
distance,  then  we  should  do  as  well  at  predicting  the 
confusion  matrix  for  Stimulus  Set  C as  for  Stimulus  Set  A, 
when  both  predictions  are  based  on  the  perceptual  space 
derived  from  Stimulus  Set  A.  If,  on  the  other  hand, 
decomposabil ity  is  not  satisfied,  then  we  would  predict 
response  confusions  less  well  in  this  condition  than  in  the 
1 ast . 


The  three  observers  of  Group  A were  run  in  this 
condition  for  seven  sessions.  The  data  from  the  first 
session  were  regarded  as  practice  and  excluded  from 
analysis;  the  900  trials  from  the  other  six  sessions  were 
combined  for  each  observer  into  a confusion  matrix.  The 
model  was  then  fitted  to  each  observer's  data  using  the 
two-stage  procedure  described  previously. 

The  results  are  shown  in  Table  2 under  Condition  2, 
Group  A.  The  model  accou  ted  for  an  average  of  96  percent 
of  the  variance  in  the  observed  confusion  matrices,  a figure 
almost  identical  to  the  95  percent  of  the  variance  accounted 
for  in  the  first  condition  for  these  same  observers. 
Moreover,  the  percentages  of  variance  accounted  for  in  the 
two  conditions  agree  within  one  percent  for  each  of  the 
three  observers  individually.  This  result  offers  strong 
support  for  the  model’s  assumption  of  dimensional 
decomposability. 
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Adaptive  Tuning  of  Dimension  Salience  Weights 

In  this  section,  we  investigate  the  flexibility  shown 
by  observers  in  adjusting  or  "tuning"  the  distribution  of 
salience  weights  in  response  to  changes  in  the  set  of 
stimuli  to  be  identified.  In  particular,  we  hypothesize 
that  for  a given  set  of  stimuli,  the  observer  engages  in  an 
adaptive  process  of  adjusting  the  relative  salience  weights 
over  the  set  of  perceptual  dimensions  so  as  to  maximize  the 
probability  of  a correct  identification.  In  terms  of  the 
model,  the  optimal  distribution  of  salience  weights  will 
clearly  vary  as  a function  of  the  relative  average 
interstimul us  spacing  of  the  stimuli  on  each  of  the 
dimensions.  With  the  loci  on  all  other  dimensions  fixed, 
the  optimal  relative  salience  of  a particular  dimension  will 
decrease  towards  zero  as  the  average  inter st imul us  spacing 
on  that  dimension  decreases  towards  zero. 

In  each  of  three  conditions,  we  made  one  of  three 
physical  dimensions  --  Width,  FI  location,  and  AM  — less 
useful  to  the  observer  by  "squeezing"  the  values  on  the 
dimension  closer  together  than  they  were  in  the  reference 
condition  (Condition  1).  Our  expectation  --  and  model 
prediction  — is  that,  in  each  case,  the  salience  of  the 
compressed  dimension  will  decrease  relative  to  its  value  in 
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the  reference  condition^  We  discuss  here  the  results  only 
for  the  condition  in  which  we  compressed  the  Width  dimension 
(Group  B,  Condition  2).  The  results  for  the  other  two 
conditions,  in  which  we  compressed  FI  location  (Group  B, 
Condition  3)  and  AM  values  (Group  A,  Condition  3),  showed 
effects  similar  in  type,  but  less  pronounced  in  magnitude 
due  to  the  fact  that  we  compressed  the  FI  location  and  AM 
dimensions  less  than  we  compressed  Width.  The  fits  of  the 
model  and  parameter  estimates  for  those  other  two  conditions 
are  included,  however,  in  Table  2. 

The  stimuli  in  the  compressed  Width  condition  were 
identical  to  those  in  Stimulus  Set  A,  except  that  the  five 
values  of  Width  represented  among  the  nine  stimuli  were 
compressed  from  60,  6S , 70,  76,  and  80  elements  to  68,  69, 
70,  71,  and  72  elements,  respectively  — a five-fold 
reduction  in  the  range  on  this  dimension. 

A confusion  matrix  was  obtained  for  each  of  the  three 
observers  in  Group  B,  based  on  900  trials  from  the  last  six 


*Another  plausible  intuition  suggests  that  as  the 
d i scrim inab il ity  on  a dimension  is  decreased  by  compression, 
the  observer  should  compensate  by  devoting  more  attention  to 
that  dimension.  The  model  does  not  support  this  prediction 
because  the  total  amount  of  attention  to  be  distributed  over 
dimensions  is  fixed  (at  unity  in  the  model).  Thus,  an 
increase  in  the  salience  of  a dimension  of  reduced  utility 
is  achieved  only  at_  the  ex  pense  of  a decrease  in  the 
salience  of  some  other  dimension  of  relatively  enhanced 
util ity. 
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of  seven  sessions;  the  first  session  was  excluded  as 
practice.  Compressing  Width  had  a marked  effect  on  the 
over-all  probability  of  a correct  identification,  reducing 
it  from  .45  for  this  group  in  Condition  1 to  .29  in  this 
condition . 


In  order  to  fit  the  model  to  the  observed  confusion 
matrices,  estimates  were  required  of  the  perceptual 
locations  corresponding  to  the  new  physical  Width  values. 
These  were  obtained  from  the  linear  psychophysical  function 
relating  the  physical  and  perceptual  Width  dimensions 
derived  from  the  sim  ilarity- judgment  task  (see  Fig.  2).  The 
resulting  goodness-of- f it  values  and  parameter  estimates  are 
given  in  Table  2 (Condition  2,  Group  B)  for  each  observer. 
For  two  of  the  observers  (JK  and  JS),  the  salience  weight 
for  Width  dropped  markedly,  as  expected,  from  .38  and  .45  in 
Condition  1 to  .08  and  .11  in  Condtion  2,  respectively.  For 
the  third  observer  (DT),  however,  the  Width  salience  weight 
increased  slightly  from  .47  to  .52,  contrary  to  our 
expectation  based  on  adaptive  tuning. 


A further  analysis  aids  our  understanding  of  these 
results  considerably.  In  order  to  determine  the  temporal 
course  of  tuning,  we  reanalyzed  the  Group  B data  of  both 
Conditions  1 and  2 in  successive  300-trial  blocks  (thirds  of 
a condition)  and  fitted  the  model  separately  to  each  of 
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these  successive  blocks.  The  estimated  salience  weights  for 
the  final  300-trial  block  of  Condition  1 and  each  successive 
block  of  Condition  2 are  shown  for  each  observer  by  the 
solid  lines  in  the  left-hand  panels  of  Fig.  6.  In  addition, 
for  a particular  set  of  stimuli  — and  given  the  obtained 
estimates  of  an  observer's  response  biases  and 
sensitivity  — it  is  possible  to  determine  from  the  model 
the  pattern  of  salience  weights  that  would  maximize  the 
observer's  probability  of  a correct  identification.  We 
determined  these  optimal  weights,  shown  in  Fig.  6 by  the 
dashed  lines,  for  each  successive  block,  for  each  observer. 

Comparing  observed  and  optimal  salience  weight  patterns 
we  see  that  observers  JK  and  JS  demonstrate  relatively 


optimal  weight 

patterns 

in 

the 

final  blocks  both 

of 

Conditions  1 

and  2 . 

The 

optimal  weight 

pattern 

for 

Condition  2 is 

quite  different 

than 

that  for 

Condition 

1 . 

Both  observers  JK  and  JS  redistribute  their  salience  weights 
to  match  the  new  optimal  patterns  by  the  second  block  of 
300-trials.  The  tuning  process  is  sufficiently  gradual, 
however,  so  that  the  observed  pattern  for  both  observers  for 
the  first  300  trial  block  of  Condition  2 is  intermediate 
between  that  present  at  the  end  of  Condition  1 and  that 
arrived  at  by  the  middle  of  Condition  2. 
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In  contrast,  a comparison  of  the  obtained  and  optimal 
salience  weight  patterns  for  observer  DT  shows  little 
evidence  of  tuning.  Over  successive  thirds,  the  observed 
weight  pattern  changes  considerably,  but  in  ways  which 
appear  to  bear  little  relationship  to  the  optimal  weight 
pattern.  A possible  explanation  of  this  result  becomes 
clear  when  we  compare  the  probability  of  a correct 
identification,  Pr(C),  obtained  when  the  optimal  weight 
pattern  is  used  with  that  obtained  when  the  observed, 
non-optima1  weight  pattern  is  used.  These  optimal  and 
observed  correct  identification  probabilities  are  shown  for 
each  successive  300-trial  block  in  the  right-hand  panels  of 
Fig.  6.  Not  sur pr isingl y , the  observed  Pr(C)  is  no  more 
than  a few  percent  below  the  optimal  value  in  all  cases  for 
observers  JK  and  JS,  both  of  whom  adopted  nearly  optimal 
weight  patterns.  What  is  surprising,  however,  is  that  the 
observed  Pr(C)  for  observer  DT  is,  on  the  average,  no  more 
than  6 percent  below  the  optimal  value.  For  our  stimuli, 
the  optimization  surface  determined  by  the  three  salience 
weights  is  very  flat,  so  that  relatively  large  departures  in 
the  weights  from  the  optimal  pattern  result  in  only  a modest 
decrement  in  identification  accuracy.  According  to  this 
reasoning,  we  may  interpret  the  performance  of  observers  JK 
and  JS  as  indicating  a high  sensitivity  to  small  deviations 
from  optimal  Pr(C)  while  the  performance  of  observer  DT 
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indicates  a lesser  sensitivity.  We  speculate  that  if 
stimuli  were  chosen  in  such  a way  that  the  optimization 
surface  were  steep,  we  would  find  evidence  of  tuning  in  the 
performance  of  all  observers,  including  that  of  observer  DT. 

Conclusions 

Our  results  demonstrate  the  existence  of  an  adaptive 
process  of  tuning  of  dimension  salience  weights  in  the 
identification  process.  For  the  identification  task,  the 
data  are  consistent  with  the  hypothesis  that  observers  tune 
to  maximize  the  average  probability  of  correct 
identification.  Starting  from  a very  non-optimal  weight 
distribution,  we  observed  that  the  tuning  process  may  take 
place  over  tens  or  hundreds  of  trials.  These  results 
support  the  view  that  the  human  observer  is  indeed  flexible 
in  his  or  her  use  of  perceptual  dimensions  in  pattern 
identification  or  classification,  adjusting  the  weighting  of 
dimensions  to  the  characteristics  of  the  set  of  stimuli  at 
hand  and  the  particular  requirements  of  the  identification 
or  classification  task. 

The  results  of  the  two  conditions  of  the 
sim il ar ity- j udgment  task  affirm  the  reliability  of  the 
INDSCAL  MDS  procedure  in  that  we  derived  the  same  three 
perceptual  dimensions,  and  linear  psychophysical  functions 
on  all  three  dimensions,  in  both  conditions. 
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We  also  obtained  support  for  the  model's  assumption  of 
dimension  decomposabil ity . Having  obtained  estimates  of  the 
perceptual  loci  for  one  particular  set  of  stimuli,  we  were 
then  able  to  predict  the  confusion  matrix  equally  well  for 
several  different  combinations  of  the  same  set  of  values  on 
the  three  dimensions. 

Finally,  we  note  the  over-all  success  of  the  model  in 
predicting  the  confusion  matrices  over  the  three  conditions 
of  the  identification  task  for  each  of  the  two  groups  of 
observers.  Averaged  over  the  18  comparisons  of  predicted 
and  observed  confusion  matrices  (Table  2),  the  model 
accounted  for  91  percent  of  the  observed  variance. 
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